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Abstract 

We study the maximum weight matching problem in the semi-streaming model, and improve 
on the currently best one-pass algorithm due to Zelke (Proc. STACS '08, pages 669-680) by 
devising a deterministic approach whose performance guarantee is 4.91 + e. In addition, we 
study preemptive online algorithms, a sub-class of one-pass algorithms where we are only allowed 
to maintain a feasible matching in memory at any point in time. All known results prior to 
Zelke's belong to this sub-class. We provide a lower bound of 4.967 on the competitive ratio of 
any such deterministic algorithm, and hence show that future improvements will have to store 
in memory a set of edges which is not necessarily a feasible matching. 

1 Introduction 

The computational task of detecting maximum weight matchings is one of the most fundamental 
problems in discrete optimization, attracting plenty of attention from the operations research, 
computer science, and mathematics communities. (For a wealth of references on matching problems 
see [H].) In such settings, we are given an undirected graph G = (V, E) whose edges are associated 
with non-negative weights specified by w : E — > M + . A set of edges M C E is a matching if no 
two of the edges share a common vertex, that is, the degree of any vertex in (V,M) is at most 1. 
The weight w(M) of a matching M is defined as the combined weight of its edges, i.e., Yle&M w i e )- 
The objective is to compute a matching of maximum weight. We study this problem in two related 
computational models: the semi-streaming model and the preemptive online model. 

The semi-streaming model. Even though these settings appear to be rather simple as first 
glance, it is worth noting that matching problems have an abundance of flavors, usually depending 
on how the input is specified. In this paper, we investigate weighted matchings in the semi- 
streaming model, was first suggested by Muthukrishnan [TO]. Specifically, a graph stream is a 
sequence , ej 2 , . . . of distinct edges, where , ej 2 , . . . is an arbitrary permutation of E. When an 
algorithm is processing the stream, edges are revealed sequentially, one at a time. Letting n = \V\ 
and m = \E\, efficiency in this model is measured by the space S(n, m) a graph algorithm uses, the 
time T(n, m) it requires to process each edge, and the number of passes P(n, m) it makes over the 
input stream. The main restriction is that the space S(n,m) is limited to 0(n ■ polylog(n)) bits 
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of memory. We refer the reader to a number of recent papers |1Q|, O HI [21 [9] and to the references 
therein for a detailed literature review. 

Online graph problems. Unlike the semi-streaming model, in online problems the size of the 
underlying graph is not known in advance. The online matching problem has previously been 
modeled as follows. Edges are presented one by one to the algorithm, along with their weight. 
Once an edge is presented, we must make an irrevocable decision, whether to accept it or not. An 
edge may be accepted only if its addition to the set of previously accepted edges forms a feasible 
matching. In other words, an algorithm must keep a matching at all times, and its final output 
consists of all edges which were ever accepted. In this model, it is easy to verify that the competitive 
ratio of any (deterministic or randomized) algorithm exceeds any function of the number of vertices, 
meaning that no competitive algorithm exists. However, if all weights are equal, a greedy approach 
which accepts an edge whenever possible, has a competitive ratio of 2, which is best possible for 
deterministic algorithms [7J. 

Similarly to other online settings (such as call control problems [5]), a preemptive model can 
be defined, allowing us to remove a previously accepted edge from the current matching at any 
point in time; this event is called preemption. Nevertheless, an edge which was either rejected or 
preempted cannot be inserted to the matching later on. We point out that other types of online 
matching problems were studied as well [7J EJ EJ CQ . 

Comparison between the models. Both semi-streaming algorithms and online algorithms 
perform a single pass over the input. However, unlike semi-streaming algorithms, online algorithms 
are allowed to concurrently utilize memory for two different purposes. The first purpose is obviously 
to maintain the current solution, which must always be a feasible matching, implying that the 
memory size of this nature is bounded by the maximal size of a matching. The second purpose is to 
keep track of arbitrary information regarding the past, without any concrete bound on the size of 
memory used. Therefore, in theory, online algorithms are allowed to use much larger memory than 
is allowed in the semi-streaming model. Moreover, although this possibility is rarely used, online 
algorithms may perform exponential time computations whenever a new piece of input is revealed. 
On the other hand, a semi-streaming algorithm may re-insert an edge the current solution, even 
if it has been temporarily removed, as long as this edge was kept in memory. This extra power is 
not allowed for online (preemptive) algorithms, making them inferior in this sense in comparison 
to their semi-streaming counterparts. 

Previous work. Feigenbaum et al. [3] were the first to study matching problems under similar 
assumptions. Their main results in this context were a semi-streaming algorithm that computes 
a (3/2 — ^-approximation in 0(log(l/e)/e) passes for maximum cardinality matching in bipartite 
graphs, as well as a one-pass 6- approximation for maximum weighted matching in arbitrary graphs. 
Later on, McGregor [9] improved on these findings, to obtain performance guarantees of (1 + e) 
and (2 + e) for the maximum cardinality and maximum weight versions, respectively, being able to 
handle arbitrary graphs with only a constant number of passes (depending on 1/e). In addition, 
McGregor [9] tweaked the one-pass algorithm of Feigenbaum et al. into achieving a ratio of 5.828. 
Finally, Zelke [12J has recently attained an improved approximation factor of 5.585, which stands 
as the currently best one-pass algorithm. Note that the 6-approximation algorithm in [3] and 
the 5.828-approximation algorithm in [9] are preemptive online algorithms. On the other hand, 
the algorithm of Zelke [12] uses the notion of shadow-edges, which may be re-inserted into the 
matching, and hence it is not an online algorithm. 
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Main result I. The first contribution of this paper is to improve on the above-mentioned results, 
by devising a deterministic one-pass algorithm in the semi-streaming model, whose performance 
guarantee is 4.91 + £. In a nutshell, our approach is based on partitioning the edge set into O(logn) 
weight classes, and computing a separate maximal matching for each such class in online fashion, 
using 0(n ■ polylog(n)) memory bits overall. The crux lies in proving that the union of these 
matchings contains a single matching whose weight compares favorably to the optimal one. The 
specifics of this algorithm are presented in Section [2l 

Main result II. Our second contribution is motivated by the relation between semi-streaming 
algorithms and preemptive online algorithms, which must maintain a feasible matching at any point 
in time. To our knowledge, there are currently no lower bounds on the competitive ratio that can 
be achieved by incorporating preemption. Thus, we also provide a lower bound of 4.967 on the 
performance guarantee of any such deterministic algorithm. As a result, we show that improved 
one pass algorithms for this problem must store more than just a matching in memory. Further 
details are provided in Section [3l 

2 The Semi-Streaming Algorithm 

This section is devoted to obtaining main result I, that is, an improved one-pass algorithm for 
the weighted matching problem in the semi-streaming model. We begin by presenting a simple 
deterministic algorithm with a performance guarantee of 8. We then show how to randomize 
its parameters, still within the semi-streaming framework, and obtain an expected approximation 
ratio of 4.9108. Finally, we de-randomize the algorithm by showing how to emulate the required 
randomness using multiple copies (constant number) of the deterministic algorithm, while paying 
an additional additive factor of at most e, for any fixed e > 0. 

2.1 A simple deterministic approach 

Preliminaries. We maintain the maximum weight of any edge u> max seen so far in the input 
stream. Clearly, the maximum weight matching of the edges seen so far has weight in the interval 
[ro max , t| w max \. Note that if we disregard all edges with weight at most 2e ^ nax , the weight of the 
maximum weight matching in the resulting instance decreases by an additive term of at most 
e^max < eOPT. 

Our algorithm has a parameter 7 > 1, and a value 4> > 0. We define weight classes of edges in 
the following way. For every i £ Z, we let the class W% be the collection of edges whose weight is 
in the interval [(jyy 1 , 4>Y +1 )- We note that by our initial assumption, the weight of each edge is in 
the interval [ 2e "' max , w max ], and we say that a weight class Wi is under consideration if its weight 
interval [4>j l , 4>j t+1 ) intersects [ 2£ ^ ax , w max \. The number of classes which are under consideration 
at any point in time is 0(log 7 (j)). 

The algorithm. Our algorithm simply maintains the list of classes under consideration and 
maintains a maximal (unweighted) matching for each such class. In other words, when the value 
of w ma x changes, we delete from the memory some of these matchings, corresponding to the classes 
which stop being under consideration. Note that to maintain a maximal matching in a given 
subgraph, we only need to check if the two endpoints of the new edge are not covered by existing 
edges of the matching. 

To conclude, for every new edge e G E we proceed as follows. We first check if w(e) is greater 
than the current value of w m3iX . If so, we update w max and the list of weight classes under consid- 
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eration accordingly. Then, we find the weight class of w(e), and try to extend its corresponding 
matching, i.e., e will be added to this matching if it remains a matching after doing so. 

Note that at each point the content of the memory is the value w max and a collection of 
0(log 7 (|)) matchings, consisting of 0(ralog 7 (j)) edges overall. Therefore, our algorithm indeed 
falls in the semi-streaming model. 

At the conclusion of the input sequence, we need to return a single matching rather than 
a collection of matchings. To this end, we could compute a maximum weighted matching of the 
edges in the current memory. However, for the specific purposes of our analysis, we use the following 
faster algorithm. We sort the edges in memory in decreasing order of weight classes, such that the 
edges in Wj appear before those in Wj_i, for every i. Using this sorted list of edges, we apply 
a greedy algorithm for selecting a maximal matching, in which the current edge is added to this 
matching if it remains a matching after doing so. Then, the post-processing time needed is linear in 
the size of the memory used, that is, 0(n log 7 (^)). This concludes the presentation of the algorithm 
and its implementation as a semi-streaming algorithm. 

Analysis. For purposes of analysis, we round down the weight of each edge e such that w(e) G Wj 
to be 07*. This way, we obtain rounded edge weights. Now fix an optimal solution opt and denote 
by opt its weight, and by opt' its rounded weight. The next claim immediately follows from the 
definition of Wj. 

Lemma 2.1. opt < 70Pt'. 

As an intermediate step, we analyze an improved algorithm which keeps all weight classes. That 
is, for each i, we use Mj to denote the maximal matching of class Wj at the end of the input, and 
denote by M the solution obtained by this algorithm, if we would have applied it. Similarly, we 
denote by OPTj the set of edges in opt which belong to Wj. For every i, we define the set of vertices 
Pi, associated with W{, to be the set of endpoints of edges in Mj that are not associated with higher 
weight classes: 

Pi = {u,v | (u, v) G Mj} \ (Pj+i U Pi+2 U ■ ■ ■ ). 

For a vertex p G Pi, we define its associated weight to be 07*. For vertices which do not belong 
to any Pj, we let their associated weight be zero. We next bound the total associated weight of all 
the vertices. 

Lemma 2.2. The total associated weight of all the vertices is at most • w(M). 

Proof. Consider a vertex u G Pi and let (u, v) be the edge in Mj adjacent to u. If (u, v ) G M then 
we charge the weight associated with u to the edge (u, v). Thus, an edge e G Mj is charged at most 
twice from vertices associated with its own weight class. Otherwise, if (u, v) £ M then there must 
be some other edge e G M n Mj, for some j > i, that prevented us from adding (u,v) to M, in 
which case we charge the weight associated with u to e. Notice that u e, for otherwise, u would 
not be associated with Wj. Thus, the edge e G Mj must be of the form e = (v,x) and can only be 
charged twice from vertices in weight class i, once through v and once through x. 

To bound the ratio between w(M) and the total associated weight of the vertices, it suffices 
to bound the ratio between the weight of an edge e G M and the total associated weight of the 
vertices which are charged to e. Assume that e G Mj, then there are at most two vertices which 
are charged to e and class i for all i < j, and no vertex is associated to e and class i for i > j. 
Hence, the total associated weight of these vertices is at most 

i<j i'=0 ' 1 ' ' 
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and the claim follows since w(e) > (fry 3 . ■ 

It remains to bound opt' with respect to the total associated weight. 
Lemma 2.3. opt' is at most the total weight associated with all vertices. 

Proof. It suffices to show that for every edge e = (x, y) € OPTj the maximum of the associated 
weights of x and y is at least the rounded weight of e. Suppose that this claim does not hold, then 
x and y are not covered by M,, as otherwise their associated weight would be at least frf . Hence, 
when the algorithm considered e, we would have added e to Mj, contradicting our assumption that 
x and y are not covered by Mj. ■ 

Using the above sequence of lemmas, and recalling that we lose another e in the approximation 
ratio due to disregarding edges of weight at most 2gi " max , we obtain the following inequality: 



OPT < 7OPT' < ( 7 • 



27 
7-1 



+ e -w{M). 



(2.1) 



Therefore, we establish the following theorem. 



Theorem 2.4. Our simple deterministic algorithm has an approximation ratio 0/ (-^j + e). This 
ratio can be optimized to 8 + e by picking 7 = 2. 



The next example demonstrates that the analysis leading to Theorem 12.41 is tight. 

Example 2.5. Let k be some large enough integer and e > be sufficiently small. Consider the 
instance depicted in Figure [H where M = M& consists of a single edge (x,y) with weight 7 fc . For 
every < i < k, the matching M, consists of exactly two edg ) and (y,(3i) each of weight 7*, 

and OPTj consists of two edges (a^, a^) and (Pi, bi) each of weight r y' l+l — e. In addition, there are two 
edges (afc, x) and (bk, y) whose weight is j k+l — e. It is easy to see that each Mi is indeed maximal 
in its own weight class. Given these matchings, our greedy selection rule will output a single edge 
(x, y) with total weight j k (notice that computing a maximum weight matching in Mq U • • • U Mk 
does not help when 7 > 2). Moreover, the value of the optimal solution matches our upper bound 
up to an additive 0(e) term. 

ak bk 




Figure 1: A tight example for our deterministic algorithm. 
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2.2 Improved approximation ratio through randomization 

In what follows, we analyze a randomized variant of the deterministic algorithm which was presented 
in the previous subsection. In general, this variant sets the value of <j> to be <f> = 7^ where 5 is a 
random variable. This method is commonly referred to as randomized geometric grouping. 

Formally, let 5 be a continuous random variable which is uniformly distributed on the interval 
[0, 1). We define the weight class W f (<5) = [~f i+s ,j i+1+s ), and run the algorithm as in the previous 
subsection. Note that this algorithm uses only the partition of the edges into classes and not the 
precise values of their weights. In addition, we denote by M{5) the resulting matching obtained by 
the algorithm, and by TW(S) the total associated weight of the vertices, where for a vertex p 6 Pj 
we define its associated weig ht to be 7 i+5 (i.e., the minimal value in the interval Wi{8)). We also 
denote by OPt'(<5) the value of opt' for this particular 5. 

For any fixed value of <5, inequality (|2.ip immediately implies OPt'(5) < (^ry + s) ■ w(M(5)). 
Note that OPt'(<5) and w(M(5)) are random variables, such that for each realization of 8 the above 
inequality holds. Hence, this inequality holds also for their expected values. That is, we have 
established the following lemma where E^f-] represents expectation with respect to the random 
variable 5. 

Lemma 2.6. E s [opt'(5)} < + E ) . E s [w(M(S))]. 

We next lower bound opt in terms of E ( j[opt'((5)]. 
Lemma 2.7. ^ • E 5 [opt / (5)] > opt. 

Proof. We will show the corresponding inequality for each edge e € opt. We denote by w' s {e) the 
rounded weight of e for a specific value of 5. Then, it suffices to show that -^j- • Fi§[wg(e)] > w(e). 
Let p be an integer, and let < a < 1 be the value that satisfies w{e) = jP +a . Then, for 5 < a, 
w' 5 (e) = 7 P+5 , and for 5 > a, w' s (e) = thus the expected rounded weight of e over the 

choices of 5 is 

E s [w' s (e)} = f ^+ s dd+ f ^~ 1+s d6 = -L • ( 7 p ( 7 Q - 1) + l p ~\l ~ l a )) = w(e) .(l-l)J-, 
Jo Ja ln 7 V l) ln 7 

and the claim follows. ■ 

Combining the above two lemmas we obtain that the expected weight of the resulting solution 
is at least ( 7 e ) ' OPT - This approximation ratio is optimized for 7 ~ 3.513, where it is 
roughly (4.9108 + e). Hence, we have established the following theorem. 

Theorem 2.8. The randomized algorithm has an approximation ratio of roughly 4.9108 + e. 

2.3 Derandomization 

Prior to presenting our de-randomization, we slightly modify the randomized algorithm of the 
previous subsection. In this variation, instead of picking 5 uniformly at random from the interval 
[0, 1) we pick 5' uniformly at random from the discrete set {0, ^-}. We apply the same 

method as in the previous section where we replace 5 by 5'. Then, using Lemma 12.61 we obtain 
E 5 /[opt'(5')] < ( 7 ~?i + e) • Es>[w(M(5'))]. To extend Lemma O to this new setting, we note 
that 5' can be obtained by first picking 5 and then rounding it down to the largest number in 
{0, i, |, . . . , which is at most 6. In this way, we couple the distributions of 5 and 5' . Now 
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consider the rounded weight of an edge e in opt in the two distinct values of 5 and 5' . The 
ratio between the two rounded weight is at most 7 1 / 9 . Therefore, we establish that ^ "7 ' ' 

E^[opt'(5)] > opt. Therefore, the resulting approximation ratio of the new variation is 27 If 7 + 
£. By settinf q to be large enough (picking q = |~ log pygy ] is enough), the resulting approximation 

ratio is bounded by 27 _ 1 1 " 7 + 2e. 

De-randomizing the new variation in the semi-streaming model is straightforward. We simply 
run in parallel all q possible outcomes of the algorithm, one for each possible value of 5' , and pick 
the best solution among the q solutions we obtained. Since q is a constant (for fixed values of e), the 
resulting algorithm is still a semi-streaming algorithm whose performance guarantee is 4.9108 + 2e. 
By scaling e prior to applying the algorithm, we establish the following result. 

Theorem 2.9. For any fixed e > 0, there is a deterministic one-pass semi-streaming (4.9108 + e)- 
approximation algorithm for the weighted matching problem. This algorithm processes each input 
edge in constant time and required 0(n) time at the end of the input to compute the final output. 

3 Online Preemptive Matching 

In this section, we established the following theorem. 

Theorem 3.1. The competitive ratio of any deterministic preemptive online algorithm is at least 
1Z ~ 4.967, where 1Z is the unique real solution of the equation x 3 = 4(x 2 + x + 1). 

Recall that the algorithms of [3] and [9] can be viewed as online preemptive algorithms; their 
competitive ratios are 6 and 5.828, respectively. 

Definitions of some constants. Let C = 1Z — e for some e > and assume that a deterministic 
online algorithm achieves a competitive ratio of at most C = C — e. We construct an input graph 
iteratively, and show that after a finite number of steps, the competitive ratio is violated. 

In the construction of the input, all edge weights come from two weight sequences. The main 
weight sequence is u>x, u>2, ■ ■ ., and an additional weight function is w' 2 , w' 3 , . . .. These sequences are 
defined as follows: 

• wi = 1, and w k+i = 2^pr((C 2 + l)w k - C YaZI m) for k > 1. 

• w 'k+i = + l ) w k+l ~ Wk)- 

The first sequence is defined for k + 1 only as long as w k -i > Wk-2- As soon as w k < Wk-i, the 
sequence stops with w k +i, and the length of the sequence Wi is n = k + 1. We later show that such 
a value k must exist. Let Si = YTj=i w j (and So = 0). 

Properties of the sequences. By definition, since w' i+1 = uij+i + ^(wi+i — Wi), if w k < tOfc-i, 
then Wk+i < Wk holds as well. Note that W{ < w[ for all i < n— 1, by definition, since Wi > Wi—i, 
but w' n _ 1 < w n -\. In addition, we have the following: 

w' i+1 + Wi+i + Si-i = Cwi. 
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This equality holds for i = 1,2,. 



, n — 2 since 



C + l Wi 



+ Wi+1 + di-1 = n W i+ \ - — + Wi+1 + 



C 2C + 1 

Cwi, 



((C 2 + l) Wi - C5i_i) + 



where the first equality holds by definition of w' i+i , the second equality holds by definition of Wi+\, 
and the third one by simple algebra. In addition, 

Si-2 + Wi + Wi+l + w'i + \ = Cw[. 
The last equality holds for i = 2,3, . . . ,n — 2 since 

2C + 1 Wi 

Oi-2 + Wi + m+1 + «>t+l = + Wi H ^ Wi+l - — 

= 5 i _ 2 + ^^ + ^±l-^ T ((C 2 + lK-C5 i _ 1 )) 

= (C + l)wi + 5'i_2 - Si-i 
= (C + l)wj - 
= Cto-, 

where the first equality holds by definition of w' i+1 , the second by definition of Wi + \, the third by 
simple algebra, the fourth by definition of and Sj_2, and the last one by definition of w[. 



Input construction, step 1. To better understand our construction, we advice the reader to 
consult Figure [2j The input is created in n steps. In the initial step, two edges (a±, xx) and (b\, x\), 
each of weight w\, are introduced. Assume that after both edges have arrived, the online algorithm 
holds the edge (a\,x\). All future edges either have endpoints which are new vertices, or in the set 
{a\,xi} (i.e., they do not contain b\ as an endpoint). An optimal solution keeps (&i,sci). 



Input construction, properties. Every future step can be of two distinct types, which will be 
described later on. Among the edges introduced below, vertices called bj denote endpoints which 
occur each on a single edge. 

After step i, the following invariants are maintained. The algorithm keeps a single edge denoted 
by ej. If i = 1, then ej = (a^Xj). If i > 1, then this edge can be one of two edges, (fli,Xi) or 
(ci,yi). If &i = (ai,Xi), then its weight is Wi, and an optimal solution has one edge of each weight 
wi,W2,---,Wi-i,Wi. No future edges will have common endpoints with these i edges, except, 
possibly, with the endpoint x% of the edge of weight Wi (the edge of this weight which this optimal 
solution keeps is always (xi,bi)). Otherwise, = (cj,yi), and its weight is w[, in which case an 
optimal solution can have edges of weights W\, W2, ■ ■ ■ , Wi, except for one weight Wj for some j < i. 
This index j is used in the definition of the next step, and the properties of the current step. In 
addition to these i — 1 edges, the optimal solution also has the edge (cj, yi). Future edges will have 
endpoints which are new vertices, or in the set {ci,yi}. In the last case, the vertex is equal to 
the vertex Xj. The invariants clearly hold after the first step. We next define all other steps and 
show that the invariants hold for each option. 
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Figure 2: An example of five steps of the lower bound construction. The curved edges denote 
the edge kept by the online algorithm at each time. In the first two steps, the edges (xj,Oj) are 
chosen by the algorithm. In the third step (xs, 03) is not chosen by the algorithm, so (j/3, C3) arrives 
next. In the fourth (X4, 04) is not chosen by the algorithm, so (y^, C4) arrives next. In the fifth step 
(;r 5 ,a 5 ) is chosen by the algorithm, so no further edges arrive in this step. 

Input construction, step n. If i + 1 = n, the last step consists of an edge of weight w n . Let 
x n = «n-i) if e n _i = (a n _i,x n _i) and otherwise x n = c n -\. The new edge is (x n ,b n ), where b n is 
a new vertex. This edge has a common endpoint with the edge that the algorithm has. In fact, 
the algorithm has an edge of weight at least w n -\ > w n , and thus we assume that it does not 
preempt it. If the algorithm has an edge of weight w n -i, the edge (x n , b n ) does not have Xi as an 
endpoint, so adding the new edge to the optimal solution does not require the removal of any edges, 
and the profit of the optimal solution is S n . If the algorithm has an edge of weight w' n _ l , the new 
edge is (c n -i,b n ). We replace the edge (c„_i,y n _i) of the optimal solution by the new edge. In 
addition, the edge (y n -i,bj) = (xj,bj) (where j is the index such that the optimal solution before 
the modification of the current step does not have an edge of weight Wj ) is added to the optimal 
solution, since the endpoint y n -\ became free, and the endpoint bj only has degree 1. The profit of 
the optimal solution is S n again. Recall that w' n _ 1 < w n -i, and hence the algorithm earns (in both 
cases) at most w n -\. Note also that the optimal solution has value of S n and if w n < then we can 
drop the edge of this weight from the optimal solution and get a solution of value S n -±. Therefore, 
we will use S n -\ as a lower bound on the value of the optimal solution in this case. Thus we will 
show later that ff^- > C > C . 

W„-l 

Input construction, step i + 1, for i + 1 < n. We next show how to construct the edges of 
step i + 1, for the case i + 1 < n. We introduce two new edges of weight Wi + \. Let = a^, 
if ei = (ai,Xi) and otherwise x^+i = Cj. The new edges are (xj+i, h + \), and (xj+i,aj+i), where 
Oj+i and are new vertices. Both these edges have a common endpoint with the edge that the 
algorithm has, and the algorithm can either preempt the edge it has, in which case we assume 
(without loss of generality) that it now has (xj + i, aj + i), or else it keeps the previous edge. If the 
algorithm keeps the previous edge, let y%+\ = Xi, if ei = (di,Xi) and otherwise yi + \ = y%. In this 
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case a third edge, (yi+i, Cj+i), which has a weight of w' i+1 , is introduced. The vertex Cj+i is new. 

There are four cases to consider. In the first case, if the algorithm replaces the edge (aj,Xj) 
with the edge (xj+i, ctj+i) = (cij, aj+i), then an optimal solution can add the edge (xj+i, bi + \) to its 
edges, since the endpoint bj+i is new, and the endpoint cij was introduced in the previous step, in 
which the optimal solution obtained the edge (xj, bi). 

If the algorithm replaces the edge (ci,yi) with the edge (xj+i,dj+i) = (cj,aj+i), an optimal 
solution can remove the edge (q, y^) from its solution and add the two edges (xj + i, = (q, &i+i) 
and (j/j, bj) = (xj, bj) (where j is the index such that the optimal solution before the modification 
of the current step does not have an edge of weight Wj). This is possible since the endpoints 6j + i 
and bj do not have other edges, and the endpoints Cj and yi become free. 

In the last two cases, the invariants hold. For the remaining two cases note that if w[ < 
or Wi < and the algorithm has a single edge of weight w\ or Wi, respectively, then the optimal 
solution is strictly positive and the value of the algorithm is non-positive, and hence the resulting 
approximation ratio in this case is unbounded. Hence, we can assume without loss of generality 
that if the algorithm has a single edge at the end of step i, then its weight is strictly positive. 

If the algorithm does not replace the edge (aj,Xj) with the edge (xj+i,aj+i) = (aj,aj+i), we 
show that it must replace it with the edge (y%+i, Cj+i) = (xj, Cj+i). Assume that this is not the case. 
Then the profit of the algorithm is Wi and the optimal solution can omit its edge (xj, bi) and add the 
edges (xj, Cj+i) and (oj, (since all these endpoints are introduced in steps i and i + except for 
Xj, which becomes free). Thus the profit of the optimal algorithm is Si-\+Wi + \+w' i+l = C-Wi, while 
the profit of the online algorithm is toj. Thus, the algorithm must switch to the edge (xj+i, aj+i), 
and the structure of the optimal solution is according to the invariants. 

If the algorithm does not replace the edge (cj,yj) with the edge (xj+i,aj+i) = (cj,aj+i), we 
show that it must replace it with the edge (yj+i,Cj+i) = (yj,Cj+i). Assume that this is not the 
case. Then the profit of the algorithm is w[ and the optimal solution can omit its edge (ci,yi) and 
add the edges (cj,aj+i) and (yj,Cj+i) (since Cj and j/j become free, and the other two endpoints 
are introduced in step i + Thus the profit of the optimal algorithm is Si — Wj + Wi+i + w' i+1 , 
where j < i — 1 and % > 2, since t«j < Wj + \ <••• < Wi-i as i — 1 < n — 2, we get that the optimal 
profit is at least Si_2 + Wi + ^i+i + = Cw^, while the profit of the online algorithm is w^. 
Thus, the algorithm must switch to the edge (xj+i, Oj+i), and the structure of the optimal solution 
is according to the invariants. 

Bounding the competitive ratio. We next define a recursive formula for Si. By the definition 
of the sequence Wi, we have 

' S = 

< 5i = l (3.1) 

, Sk+l = C 2C+1~ 2 $k — C 2C+1 ^ — 1 

We first use this recurrence to show that if w„_i < w ra -2 then > C. To see this note that by 
assumption S'n-i — S , n ~2 < SVt-2 — S n s, hence using the recurrence formula we conclude that 

2C + 1 C 2 + 2C + 2 

S'n-i - 2S' n _2 + _^2 _ £ - \ Sn ~ X + ^2 _[_ g + x ,S ' n - 2 < °' 

that is, 

S'n-i • (C 2 + C + 1 - 2C - 1) + S„_ 2 • (C 2 + 2C + 2 - 2C 2 - 2C - 2) < 0, 

which is equivalent to (C 2 — C)S n _i — C 2 S n _2 < 0, so C(S ra _i — S ra _2) < S n _i, and we conclude 
that Cw„_i < S n _i, as we argued. Therefore, it remains to show that there is a value of n such 
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that w n -2 > w n -\. To establish this claim, it suffices to show that there is a value of j for which 
Wj < (since w\ > 0). To prove this last claim, we will show that there is a value of k such that 
Sk < 0. Finally, to show the existence of such k, we will solve the linear homogeneous recurrence 
formula, and use the explicit form of Sk to show that there is a value of k such that Sk < 0. 

To solve the recurrence formula (|3.1|) . we guess solutions of the form Sk = x k for all k, and get 
the following quadratic equation for x: 

(2C + l)x 2 - (C 2 + 2C + 2)x + (C 2 + C + 1) = 0. 

We solve this quadratic equation and get its solutions 



Xl,2 



(C 2 + 2C + 2) ± yJ{C 2 + 2C + 2) 2 - 4(2C + 1)(C 2 + C + 1) 

2(2C + 1) 

(C 2 + 2C + 2) ± VC 4 + 4C 2 + 4 + 4C 3 + 8C + AC 2 - 8C 3 - 4C 2 - 8C 2 - AC - 8C - 4 

2(2C + 1) 



{C 2 + 2C + 2) ± y/C(C 3 - 4C 2 - AC - 4) 
2(2C + 1) ' 

Note that using C < TZ, and recalling that TZ is the unique real solution of the equation x 3 = 
4(x 2 + x + 1), we conclude that C(C 3 — AC 2 — AC — A) < and hence the two solutions are complex 
numbers whose imaginary parts are not zero. Since we got two distinct solutions of x, it is known 
that the recurrence formula (|3.ip is solved by a formula of the form Si = ax\ + (3x\ where a and 
(5 are constants. We find the value of a and (3 using the conditions So = and S± = 1. So we 
get the following set of two equations: a + (3 = (corresponding to So = 0), and ax\ + (3x2 = 1 
(corresponding to S\ = 1). From the first equation we conclude that j3 = —a, and using this we 
obtain a = — - — = , 2 == Hence, the closed form solution of Si for values of C < TZ is 

xi-X2 y JC(C- i ~iC 2 -AC-A) ' 3 

as follows. 



S 



j 



2C + 1 / (C 2 + 2C + 2) + ^C(C 3 -AC 2 -AC -A) 

^C(C 3 - 4C 2 -AC -A) 1 2(2C + 1) 



2C + 1 / (C 2 + 2C + 2)- ^C(C 3 -AC 2 -AC -A) 

^C(C 3 -AC 2 -AC -A) \ 2(2C + 1) 



(3.2) 



We use the notation i = y—T, and let a = A-i. As noted above C(C — AC — AC— 4) < 0, and hence A 
is a real number. We also define r and 8 such that ( c +2C+2 ^ + 2 ^^ 1 ^ 4C 4C 4 - = r(cos(6>) + i sm(9)), 
and also ( c + 2C + 2 ) 4C 4C j) _ r ( cos (^) _ isin(#)), then we get the following formula for Sj. 

Sj = A-i - (r j (cos{9) + ism(9)) j - r j {cos(9) -ism(9)) j ) 
= A-i - [r 3 (cos(j9) + ism(j9)) — r J (cos(j9) — isin(j9))) 
= A ■ i ■ r J • 2i sin(j#) 
= -2Ar 3 ' sm(j0) . 

Note that r 3 > for all j, and hence to show that the sequence {Sj} changes its sign as we required, 
it suffices to show that the sequence {sin(j#)} changes its sign, but this last claim holds because 
< 9 < it (as the solutions x\ and X2 are not real numbers). Hence, the claim follows. 
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